Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature: TorchServe example #870

Closed

Conversation

JaysonAlbert
Copy link

Add a TorchServe example:

Why We Use TorchServe

TorchServe is designed to deliver high performance for serving PyTorch models, and it excels in the following key areas:

  1. Batching Requests: TorchServe automatically batches incoming requests, processing multiple predictions in parallel. This reduces overhead, improves throughput, and ensures efficient use of resources, especially when dealing with large volumes of requests.

  2. Horizontal Scaling: TorchServe allows for horizontal scaling, meaning it can easily scale across multiple machines or containers to handle increasing traffic. This ensures that the system remains responsive and can handle large volumes of inference requests without sacrificing performance.

Copy link
Member

@fumiama fumiama left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's quite independent. You can host this repo by yourself and open a PR, in which modify relevant segment of README, linking to your repo.

@fumiama fumiama added documentation Improvements or additions to documentation enhancement New feature or request question Further information is requested labels Jan 8, 2025
@JaysonAlbert
Copy link
Author

It's quite independent. You can host this repo by yourself and open a PR, in which modify relevant segment of README, linking to your repo.

Thank you for the suggestion! I've created a repo at ChatTTS-TorchServe, now i'm closing this PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request question Further information is requested
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants